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ABSTRACT 

Here we present the results of various approaches to measure accurate colours and 
photometric redshifts (photo-z's) from wide-field imaging data. We use data from 
the Canada-France-Hawaii- Telescope Legacy Survey (CFHTLS) which have been re- 
processed by the CFHT Lensing Survey (CFHTLenS) team in order to carry out a 
number of weak gravitational lensing studies. An emphasis is put on the correction of 
systematic effects in the photo-z's arising from the different Point Spread Functions 
(PSF) in the five optical bands. Different ways of correcting these effects are discussed 
and the resulting photo-z accuracies are quantified by comparing the photo-z's to 
large spectroscopic redshift (spec-z) data sets. Careful homogenisation of the PSF 
between bands leads to increased overall accuracy of photo-z's. The gain is particularly 
pronounced at fainter magnitudes where galaxies are smaller and flux measurements 
are affected more by PSF-effects. We discuss ways of defining more secure subsamples 
of galaxies as well as a shape- and colour-based star-galaxy separation method, and we 
present redshift distributions for different magnitude limits. We also study possible re- 
calibrations of the photometric zeropoints (ZPs) with the help of galaxies with known 
spec-z's. We find that if PSF-effects are properly taken into account, a re-calibration 
of the ZPs becomes much less important suggesting that previous such re-calibrations 
described in the literature could in fact be mostly corrections for PSF-effects rather 
than corrections for real inaccuracies in the ZPs. The implications of this finding for 
future surveys like KiDS, DES, LSST, or Euclid are mixed. On the one hand, ZP 
re-calibrations with spec-z's might not be as accurate as previously thought. On the 
other hand, careful PSF homogenisation might provide a way out and yield accurate, 
homogeneous photometry without the need for full spectroscopic coverage. This is the 
first paper in a series describing the technical aspects of CFHTLenS. 

Key vifords: galaxies: photometry, galaxies: high-redshift, galaxies: abundances 



1 INTRODUCTION 

Estimating distances of celestial objects has always 
been one of the major technical aspects in observa- 
tional astronomy. Whenever approximate redshifts of a 
very large number of faint extragalactic objects are 
needed the estimation of redshifts (and hence dis- 
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represent the only practical solution. Over the last few 
decades this technique has become increasingly important in 
extragalactic studies. Cosmological observations, inherently 
statistical in nature, particularly benefit from the availabil- 
ity of redshifts for millions of objects over large cosmological 
volumes. 

An example is weak gravitational lensing (WL; for 
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reviews see Bartelmann & Schneideil I2OOII : iMunshi et al.l 
l2008l : iHoekstra fc Jaini i2008l ). which has been established 
as an important tool to study the dark sector of the 
Universe. The first WL de t ection of a cluster of galaxies 
was made by iTvson et al.] (|l990l ). Thanks to the steady 
progress in analysis tools and better knowledge of the 
source redshift distribution WL studies have become 
an important too l to calib r ate th e masses of ind ividual 
galaxy clus t ers (iHoekstral 20071: Heymans et al.l 
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2013]) can be studied. Furthermore, the WL effect of 
the large sca le structure (LSS) of the Universe, called 



cosmic shear (Ivan \yaerbeke et al.l I2OO0I: [Kaiser et al.ll2000l 
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200^: Hoekstra et al. ' '2OO6I: iHetterscheidt ct al' |2007[ 
iSchrabback ct al. 2007, 2010; Massey et al. 2 007 ; Fu et al.1 
|2008| ). has been identified as one of the most promising 
prob e s of the effects of dark energy (DE; lAlbrecht et al.l 
I2OO6I : iPeacock et al.|[20o3 ). 

Since the lensing signals observed in these cases depend 
directly on the distances of the lenses and sources it is impor- 
tant to have an accurate knowledge of the lens-source geom- 
etry through knowing the redshifts of the objects. Modern 
WL surveys are designed in such a way that simultaneous 
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measurements of the WL observables as well as photo-z's 
are possible. In particular large, future imaging surveys that 
will cover a fair fraction of the extragalactic sky - like KiDS, 
DES, LSST, and Euclid - require extremely accurate photo- 
z's to reach a systematic accuracy in the WL measurements 
that does not compromise the survey's statistical power. Dif- 
ferent ways of achieving this goal have been discussed in the 
literature. 

In the foreseeable future all of these surveys will rely on 
ground-based multi-colour photometry, which means that 
atmospheric effects have to be corrected for. In this pa- 
per we present advanced techniques to arrive at homoge- 
neous photo-z's from inherently inhomogeneous, ground- 
based survey data. For this we use the most powerful 
WL survey to date, the Canada France Hawaii Telescope 
Legacy Survey (CFHTLS). Being a ground-based survey, 
the CFHTLS involves some unavoidable inhomogeneities, 
e.g. in terms of seeing, atmospheric extinction, etc. The 
CFHT Lensing Survey (CFHTLen^ team was formed to 
provide a reduction and analysis of the CFHTLS data opti- 
mised for WL science and addressing these challenges. The 
higher-level requirements on the data to measure accurate 
shapes and redshifts of tens of millions of galaxies made this 
'lensing-quality' reduction necessary. 

In this first paper of a series we present the multi-colour 
photometry and the resulting photo-z's upon which the fu- 
ture CFHTLenS science projects will be based. In Sect. [2] 
the CFHTLS data set and the CFHTLenS data reduction 
are presented and compared to the public data available 
from the TERAPIX team. Section [3] deals with the crucially 
important correction for atmospheric effects needed for ac- 
curate multi-colour photometry. In Sect. [4] our strategy to 
extract catalogues from the images in five different photo- 
metric bands is described. The photo-z estimation is then 
presented in Sect.[5]along with the results of the comparisons 
between photo-z's and spectroscopic redshifts. Conclusions 
are presented in Sect. [G] 



2 DATA SET 

2.1 The CFHTLS- Wide 

The Wide component of the Canada-France-Hawaii Tele- 
scope Legacy Survey (CFHTLS- Wide) commenced in mid- 
2003 and completed observations in early 2009. In more 
than 2300 hours of dark and grey time over these 5 and 
a half years, the CFHTLS- Wide imaged 172 one square de- 
gree MEGACAM fields in five filters u, g, r, i, z to a 5a point 
source limiting magnitude of jab ~ 25.5. The data span four 
contiguous fields; Wl(~ 63.8 sq. deg.), W2(~ 22.6 sq. deg.), 
W3(~ 44.2 sq. deg.) and W4(~ 23.3 sq. deg.) totalling 154 
sq. deg. once the overlap regions are accounted for. Wl, W2 
and W4 are equatorial fields with Wl and W4 containing the 
VVDS and VIPERS spectroscopic surveys. W3 is a northern 
field containing the extended Groth Strip DEEP2 spectro- 
scopic survey. A detailed report of the full CFHTLS Deep 
and Wide surveys can b e found in the TERAP IX CFHTLS 
T0006 release document (jCoranova et al]l2009l l. 

The CFHTLS- Wide was optimised for the study of weak 



gravitational lensing for which the crucial observables are 
the shape of resolved galaxies as well as their redshifts. The 
observing strategy was therefore to reserve the best seeing 
conditions with 6 < O'.'S for the lensing i-band filter and 
follow-up with the other bands in the poorer seeing condi- 
tions. That is also the reason why the i-band is our primary 
object detection band (see Sect.|4]). 



2.2 The CFHTLenS data reduction 
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marise the most important differences in data pro c essing 
between t he CARS pro ject, detailed in lErben et al.l (|2009l ) 
and Hild ebrandt et al.l |2009), and the current CFHTLenS 
data set. A more detailed description will be given in another 
publication of this series. 

The CFHTLenS project makes use of the complete 
CFHTLS- Wide data set. This includes five-colour coverage 
of 172 square degrees of high-quality data subdivided in four 
patches W1-W4 (see Sect. 12. ip . In addition we make use 
of the CFHTLS Pre-Survey which densely covers the com- 
plete survey area with shortly exposed r-band images. This 
Pre-Survey was acquired to optimise the astrometric cali- 
bration for the main science data. Similarly, to improve the 
photometric calibration of CFHTLS, the survey area was 
(re-)observed in a sparse grid under photometric conditions 
during the year 2008 (CFHT program RunlDs 08AL99 and 
08BL99). 

All CFHT MEGACAM images are initially processed 
using the Elixir software at the Canadian Astronomical 
Data Centre (CADC) and it is this archived data that we 
use in the CFHTLenS project. The current work and all 
other CFHTLenS publications use the CFHTLS- Wide im- 
ages, the astrometric Pre-Survey data and additional pho- 
tometric data that were available at CADC on January 15, 
2009. In total, the set contains 7997 Elixir processed CFHT 
MEGACAM images. 

While we processed the data on a per-pointing level in 
the CARS project, we performed all calibrations on a per- 
patch level for CFHTLenS. The inclusion of all available 
data, especially the astrometric Pre-Survey and the pho- 
tometric (re-)calibration, significantly improved the homo- 
geneity of our data. Most important for this work is our im - 
provement in photometric calibration. In lErben et al.l (j2009l ) 
we quoted the RMS uncertainty of our relative photomet- 
ric calibration between fields as Uabs,g'r'i' ~ 0.01 — 0.04 
mag, aabs,z' ~ 0.03 — 0.05 mag and aabs,u* ~ 0.15 mag. In 
CFHTLenS we now reach a « 0.01 — 0.03 mag in all pass- 
bands. Figure [1] compares magnitudes in the CFHTLenS 
field Wlplml with the magnitudes taken from the fifth 
data release of the Sloan Digit al Sky Survey (SDSS-DR5; 
lAdelman-McCarthv et al]l2007l ). For comparison the plot 
also shows this comp arison for the corre sponding CARS field 
(see also Fig. A.7 of lErben et"alll2009l '). 
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Figure 1. Photo metric compa r ison between SDSS-DR5 and 
CARS (top; see lErben et al] l2009t) and SDSS-DR5 and 
CFHTLenS (bottom) in the five different bands in one field of 
1 sq. deg. The solid horizontal lines represent the average dif- 
ference over this magnitude interval. 



2.3 The TERAPIX T0006 catalogue 

The first reduction of the CFHTLS-Wide data was per- 
formed by TERAPIXQ. Tiie data were distributed in several 
public releases. The most recent releas e, T0006 (Novem- 
ber 15, 2010; iGoranova et al.l |2009| ). includes photo- 
z's which were e s timat ed in the same way as described 
20091 ) for the T0004 release (see also 



1 



in ICoupon et al . 

ICoupon et al.l 120111 . for more details on the T0006 photo- 
z's) with the Le Phare photo-z code l|Arnouts et al.l [20021 ; 
lllbert et al.ll200^ . However, the photometric calibration im- 
proved significantly in the latest release and now covers the 
whole CFHTLS-Wide area. We will compare the photo-^'s 
from the T0006 catalogues to CFHTLenS in the following 
sections. 

The most notable differences in the multi-colour pho- 
tometry compared to CFHTLenS are: 

(i) The T0006 images are median stacks whereas th e 
CFHTLenS images are mean stacks l|Erben et al. I l2009l ). 
While median stacks allow for easier rejection of image de- 
fects (cosmic rays, reflections, asteroids, etc.) mean stacks 
are less noisy and hence lead to deeper images and more 
precise photometrylf] 

(ii) T0006 does not implement PSF homogenisation (a 



^ |kttp : //terapix . lap . f r/ 

^ A lot of effort was invested to automatically mask image defects 
in the CFHTLenS data reduction prior to the stacking. This leads 
to very clean images which can be averaged directly superseding 
the more noisy median (or also trimmed-mean) procedures. The 
standard error of the median is ^ 25 per cent larger than the 
standard error of the mean, if the noise distribution is Gaussian, 
leading to a loss of 0.24mag of depth. 



procedure correcting for the different seeings in the different 
bands as well as at different positions on the image described 
in Sect. (31) before catalogue extraction. 

(iii) T000 6 measures magnitudes in elliptical apertures 
jKronl 119801 ) instead of isophotal apertures, which we use 
for CFHTLenS (see Sect.©. 

(iv) The photometric zeropoints were re-calibrated with 
the help of a large sample of spectroscopic redshifts. See 
Sect. l5.3l for a detailed discussion of the benefits and dangers 
of such a re-calibration. 



3 PSF HOMOGENISATION 

For photo-z's we are particularly interested in measuring ac- 
curate colours of objects. For point sources this is a clearly 
defined problem. For extended objects, however, there is no 
unique definition of a colour and it depends on circumstances 
as to which definition is the most useful. For our purpose, to 
use the colours to estimate photo-z's (see Sect.[S| we need 
colours that best match those modelled from SED templates. 
As we explain below this requires identical physical aper- 
turefl on the sky in the different bands. An extreme choice 
would be to define colour as the difference in total magni- 
tude of an object in different bands; but since objects can 
have very different physical extent in different bands this ap- 
proach could lead to vastly different colours compared to the 
matched apertures, and is not optimal in a terms of signal 
to noise. 

When estimating photo-z's we compare the observed 
colours of an object to colours modelled from the convo- 
lution of SED templates with instrument response curves. 
These SED templates are either based on observations (em- 
pirical templates) or on synthetic stellar evolution models 
(synthetic templates). Both approaches yield model colours 
for a particular stellar population, and it is therefore impor- 
tant that our observations yield colours that correspond to 
the same set of stars in each band, i.e., that they represent 
the same physical aperture on the galaxy. 

Matching the physical apertures is complicated by the 
fact that different images taken through different filter bands 
will invariably show different degrees of blurring, expressed 
as the point-spread function (PSF) and measurable as the 
observed shape of the stellar images in the field. These PSF 
differences can arise from different ambient conditions dur- 
ing the observations, or from chromatic effects in the at- 
mosphere and optics of the telescope. Regardless of their 
origin they have the effect that identical apertures in differ- 
ent images of the same part of the sky do not represent the 
same physical part of a source. Not only different PSF sizes 
(seeing) , but also the more subtle effects of PSF anisotropy, 
especially with a prime-focus camera like MEGACAM, will 
affect the photometry. Compensating for the PSF differences 
between bands can be done in two ways; either by adapt- 
ing the apertures to compensate for the different PSFs, or 
by manipulating the images so as to arrive at images with 
the same PSFs. Here we will follow the latter approach, be- 
cause it is independent of the object size, and leads to more 

Here the term 'physical' refers to apertures which cover the 
same physical parts of an object in different bands, i.e. the same 
area on the sky before blurring by the atmosphere. 
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homogeneous photometry across the survey. This involves 
constructing a suitable pixel convolution kernel for each im- 
age. 

3.1 Constant Gaussian convolution kernel: global 

In this method we assume for simplicity that the PSF can 
be described by a single Gaussian with width crpsp. Under 
this assumption one can convolve an image in band X with 
a 2-dimensional Gaussian filter function of width 

CTflltcr.X = y^C^PSF.worst ~ '^PSF.X ' (1) 

to arrive at an image with a PSF size which matches 
the PSF size of the image with the worst seeing in a given 
set, CTpsF, worst- This method implies that the PSF size does 
not change with position on an image (or at least that the 
variation with position is the same in all bands an assump- 
tion that is not necessarily true for contemporary wide-field 
imaging cameras like MEGACAM on the CFHT. In the fol- 
lowing we will call this approach gl obal PSF homogeni s ation . 
It is t he sa. me a pproach as used in i Hildebrandt et al.l l|2006l . 
12007 *. '2009') and lErben et all (|2009l ) and is computationallv 
straightforward. 

3.2 Gaussianisation of the PSF with a spatially 
varying kernel: local 

In the local approach we drop the assumptions that the ini- 
tial PSF is Gaussian and position-independent, and con- 
struct a convolution kernel that is designed to make the 
PSF Gaussian everywhere, with the same width. Obviously 
this requires a non-Gaussian convolution kernel that changes 
with position on the image. Here we model the PSF and 
convolution kernels using the shapelet formali sm (|Refregieij 
I2OO3I : iRefregier fc BaconllioO^ : iKuiikenI lioO^ )— essentiallv. 
each source is described as a sum of two-dimensional Gauss- 
Hermite functions. This method is clo sely rela t ed to the 
GaaP photometric package described in iKuiikenI (|2008l ) . 

First, a catalogue is extract ed from the i-band im - 
age with the SExtractor software (|Bertin fc Arnout3ll996h . 
Stars are selected in a magnitude vs. size diagram from the 
i-band data. In each of the five bands, the images of these 
stars are then modelled by sums of shapelets up to tenth 
order, through least-squares fitting of the pixel values. The 
size of the PSF, which is needed to scale the shapelets, is 
also taken from the initial SExtractor catalogue. 

Next two-dimensional, fifth-order polynomials are fitted 
to the coefficients of the shapelet expansion to describe the 
variation of the PSF over the MEGACAM field in each band. 
From these high-order analytic descriptions of the PSF vari- 
ations in each of the five bands, the convolution kernel is 
then created. The goal is to produce five images which have 
exactly the same Gaussian PSF over the whole field. To 
avoid deconvolution and the associated noise amplification, 
the target size for the final Gaussian PSF is chosen to match 
the largest PSF size found in any position in any of the five 

^ However, it should be noted that PSF-effects can have a collec- 
tive effect changing the average properties of galaxies as a function 
of position if they are not corrected for. We defer an analysis of 
these PSF-photo-2 correlations to a forthcoming paper. 



bands of a field. Within the shapelet formalism, it is easy to 
calculate a convolution kernel that transforms the modelled 
PSF into the Gaussian target PSF at each position in each 
band, and to perform the convolution in Fourier space. More 
details can be found in appendix 1X1 

In the following we will call this approach local PSF 
Gaussianisation. 

3.3 General remarks 

It is clear that the benefits of PSF homogenisation are most 
pronounced if the changes in PSF size (or also PSF shape) 
are large between bands. We will compare both the global 
and the local approaches to the case where no PSF ho- 
mogenisation is performed. This latter case will be referred 
to with the label none. See Table [1] for a summary of the 
different schemes to homogenise the PSF. The photo-z's in 
the T0006 catalogues (see Sect. [231 1 which were estimated 
from multi-colour photometry on median stacks that had 
not been corrected for PSF-effects, will be referred to by the 
label T0006. 



4 CATALOGUE EXTRACTION 

Multi-colour catalogues are extracted from a set of PSF ho- 
mogenised images in the ugriz filters using SExtractor in 
dual-ima ge mode. The proce dure is identical to the one pre- 
sented in lErben et al.l (|2009l ) involving six SExtractor runs. 
The unconvolved (i.e. non-PSF-homogenised) i-band image 
is used as the detection image in all six runs. Five runs are 
performed with the convolved (i.e. PSF-homogenised) ugriz 
images. One additional run with the unconvolved i-band im- 
age as detection as well as measurement image is performed 
to estimate total i-band magnitudes. This latter run is nec- 
essary because the isophotal apertures, which are used for 
fiux measurements and hence colour estimation, are defined 
on the detection image by SExtractor. Using a measure- 
ment image with a larger PSF leads to fiux leaking outside 
the aperture and therefore underestimated fiuxes (overes- 
timated magnitudes). Those biased fiuxes are optimal for 
colour measurements because they correspond to the same 
physical parts of an object (see Sect. [3] above) but cannot 
readily be used to estimate total magnitudes. However, with 
the total i-band magnitudes estimated reliably in the sixth 
SExtractor run and the accurate colour measurements it is 
still possible to arrive at total magnitude estimates in all 
bands, e.g. for a band X: 

Xtot = itot + {X-i), (2) 

where {X — i) is the corresponding colour index. 

The multi-colour photometry is complemented wit h the 
following quantities (for details see lErben et al.ll2009l ): 

• Limiting magnitudes estimated from the local sky- 
background around an object. 

• E xtinction values extracted from the ISchlegel et al.l 
l|l998l ) maps at the object's position. 

• Masks based on algorithms to reject regions of low 
signal-to-noise ratio (S/N), halos and diffraction spikes of 
bright stars, and asteroids. These automated masks are then 
inspected by eye and further modified if necessary by people 
in the survey team. 
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Table 1. Different schemes to homogenise the PSF and their properties. Note that the PSF size at a random position in the global scheme 
is only approximately constant between bands because the intrinsic PSF size changes from the centre to the edge of the MEGACAM 
mosaic. Using a constant convolution kernel for PSF homogenisation between bands will only correct the PSF in one position (here 
chosen to be the centre of the image) and leave residuals at other positions. However, as we show in Sect. 15.21 these residuals are small 
enough to make the photo-^'s based on the global photometry superior to the ones based on the none or T0006 photometry (but not as 
accurate as the ones based on the local photometry). 



Scheme 


PSF size constant 
between all 5 images 
in image centres 


PSF size constant 
between all 5 images 
at any other position 


PSF size & shape constant 
independent of position 
in all 5 images 


PSF shape Gaussian 
in all 5 images 


none/T0006 


X 


X 


X 


X 


global 


V 


approximately 


X 


X 


local 


^/ 






^/ 



4.1 Photometric quality control and creation of 
the mosaic catalogue 

Before photometric redshifts are estimated (see Sect.lS} sev- 
eral tests are performed with the multi-colour photometry 
alone to ensure the integrity of the data. The surface den- 
sities of objects in a fixed magnitude range are analysed 
and the object magnitude number counts are inspected on a 
field-by-field basis. The sky distributions of the same objects 
are plotted and eye-balled and distributions of quantities like 
the half-light-radius or the posit ion- angle of galaxies for each 
field are plotted as well as colour-colour diagrams of stars 
(selected by size and magnitude). 

In order to arrive at a homogeneous mosaic catalogue 
we define hard cuts in right ascension and declination for 
each field. As the boundary between two fields we choose 
the mean of the extremal positions of objects taken from 
one field and from a neighbouring field. This method en- 
sures that no celestial object appears more than once in our 
mosaic cataloguelfl 



5 PHOTOMETRIC REDSHIFTS 



Photo-2;'s are estimated with the BPZ code (|Bem'tezll2OO0l : 
Coe et al.ll2006l) on e of t he most-widely used photo-z codes. 
Hildebrandt et al.1 (|2010l ) tested several codes against simu- 
lated and real data showing that BPZ is amongst the most 
accurate codes when combined with the best available SED 
templates. 

In the photo-« estimation process we properly take ac- 
count of objects that are not detected in one or more of the 
ugrz-hanAs. (identified by magnitude estimates fainter than 
the limiting magnitudes, which can ha ppen when SExtrac - 
tor is run in dual-ima ge mode). As in lErben etHI (|2009l ) 
and [Hildebrandt et al. [j2009l ) we use the re-calibrated tem- 
plate set of lCapakI (|2004l ) II Compared to lErben et~all (|2009l ) 
we implemented some changes to improve the low-redshift 
behaviour of the photo-z's that are explained in the follow- 
ing. 



° Note that there could still be objects which appear in two neigh- 
bouring fields because of astrometric errors. But their number is 
so low that we do not account for this here. 

^ This template set is very similar to the one used for the T0006 
catalogues. 



5.1 Modifications of the prior 

The Bayesian approach used by BPZ encompasses the cal- 
culation of the redshift likelihood and its subsequent mul- 
tiplication with a prior to yield the posterior probability of 
an object having a certain redshift given the data. For noisy 
data the peak in the likelihood always has a finite width. For 
a 2 = object the following happens: the likelihood function, 
which extends to redshifts z > Q because of its finite width, 
is multiplied with a steep prior that behaves like P[z) ~ 2"' 
for 2 < 1, with 0.9 < at < 2.5. In particular P{z = 0) = 
so that the peak of the posterior probability distribution is 
always at 2 > 0. Since BPZ picks the peak of the posterior 
as the photo-2 estimate, this leads to a systematic over- 
estimation of photo-2's at low redshift, whenever there is an 
appreciable number of low-2 objects with limited S/N (and 
hence broad likelihoods) to make this effect visible. Similar 
biases of different sever i ty can be seen in the Bayesia n photo- 
n's of e.g. ICsabai et al.1 ll2003l : Fig. 7). lBem'tej (|2000l : Fig. 7), 
and I Coupon et ah ( 20091 : Figs. 3 & 5). This behaviour is a 
general problem of template-based, Bayesian redshifts. Here 
we present an ad-hoc solution, but emphasise that further 
research is needed on this. 

To attenuate low-redsh ift bias we run BPZ twice, first 
with the original prior from iBenitej (|2000l ) 



P{z) oc 2°"' exp 



Zmt{mo] 



and again with a modified prior 

fmod(2) OC (2"' +0.05) CXp 



Zmtimo) 



(3) 



(4) 



This modified prior no longer vanishes for 2 = 0, but lev- 
els off. Figure [2] illustrates this modification. We keep the 
result of the first run unless the most probable redshift is 
below 2 = 0.1 and the ODDS parameter (i.e. the fraction 
of the integrated probability included in the primary peak 
of the posterior probability distribution; see [Bem tcz 200^ 
associated with this solution is smaller than 0.8 indicating 
a possibly biased redshift. We found that this ad-hoc mod- 
ification yields an improved low-redshift performance when 
comparing photo-2's to spectroscopic redshifts (spec-2's; see 
next section). The choice of adding 0.05 in the first term of 
Eq. |4]yields the best results for this particular data set. But 
data with different noise properties might require different 
modifications. As mentioned above these prior modifications 
are a subject of ongoing research. It will be important to find 
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Figure 2. Rodshift prior before (solid) and after (dotted) modi- 
fication, here shown for an elliptical galaxy with i = 20. 



robust, self-consistent ways of finding the optimal modifica- 
tion, possibly based on a large, spectroscopic, low-z training 
set. 



5.2 Photo-z accuracy 

We estimate the accuracy of the CFHTLenS photo-z's by 
comparing th em to spec-z's from th e VIMQS VLT Deep Sur- 
vey (VVDS; iLe Fevre et all l2005l : [Carilh et al.lboosi ) and 
the DEEP2 galaxy redshift survey l|Davis et al.ll2007l ) which 
overlap with 20 fields of the CFHTLS-Wide. The deepest 
spectroscopic fields contain objects down to i = 24, however 
with an increasing incompleteness for fainter magnitudes. 
We also add spec-z's from the S loan Digital Sky Surv ey 
seventh data release (SDSS-DR7; lAbazaiian et al.ll2009l ) if 
available in those 20 fields. We do not add the SDSS spec- 
k's in the other 90 CFHTLS-Wide fields that overlap with 
SDSS because they do not contain significant additional in- 
formation due to their low redshift. In Fig. [3] the photo- 
z's for the three different PSF homogenisation methods are 
shown against the spec-z's. Visibly the photo-z accuracy im- 
proves with increasing sophistication of the PSF homogeni- 
sation. 

We limit the analysis to galaxies, as identified by the 
spec-z surveys. The stellar spectra will be used in Sect. 15.51 
to develop criteria for star-galaxy separation. It should be 
noted that the spec-z catalogues used here are incomplete 
for i ^ 22 and mi ght paint too positive a picture of the photo- 
z accuracy (see Hildcbrand t et al.ll200i , l20ld . for extensive 
discussions of these effects). In Sect. [6] we discuss strategies 
to acquire a complete picture of the photo-z accuracy that 
will be presented in Benjamin et al. (in prep.). 

For each object with a reliable spec-z measurement 
(quality ffags 3, 4, 23, 24 for VVDS; quality ffags 3 and 4 for 
DEEP2; quality flag 3 for SDSS) we calculate the quantity 
Az = fpi£i_^;p£i^ where Zphot is the peak of the posterior 



mean of Az and the standard deviation around this mean, 
which corresponds to the photo-z scatter. This is done af- 
ter outliers have been excluded. Furthermore we report the 
total bias of the sample, i.e. the mean of Az including the 
outliers. We would like to stress that these three numbers 
(outlier fraction, scatter, bias) are not independent of each 
other and cannot reflect the full error distribution, which is 
highly non-Gaussian. 

These statistics are calculated for different narrow i- 
band magnitude bins as well as for different narrow redshift 
bins (with a pre-selection of 19 < i < 24.5) for each of 
the three different PSF homogenisation approaches, none, 
global, and local, as well as for the T0006 catalogues. Errors 
are calculated assuming Poissonian shot noise. It should be 
noted that there is non-negligible correlation between the 
errors in neighbouring magnitude/redshift bins. 

Figure |3] shows the photo-z accuracy as a function of 
i-band magnitude and redshift for the different methods. 
The effects of the PSF homogenisation can be clearly seen. 
While the performance at bright magnitudes is similar in all 
methods the photo-z scatter and outlier rates at fainter mag- 
nitudes for the two methods without PSF homogenisation 
(none and T0006) are larger than for the methods with PSF 
homogenisation {global and local). Fainter objects are also 
smaller on average and hence their shape is more strongly 
dominated by the PSF. Not correcting for PSF-effects biases 
their colours and leads to less accurate photo-z's. 

Looking at the accuracy as a function of redshift shows 
that PSF homogenisation leads to greater accuracy over 
the whole redshift range. The effect is more pronounced at 
higher redshifts, but since there are also many faint low- 
redshift galaxies the low-z statistics for global and local are 
generally better than for none and T0006 as well. 

The global and local schemes show similar redshift ac- 
curacy. At redshifts z > 0.9 local shows somewhat reduced 
photo-z scatter, but the differences are small. This finding 
supports the hypothesis from Sect. 13.31 that although the 
PSF still varies from centre to edge in the global images 
(unlike the local images) these variations are similar in all 
bands and do not lead to strong colour biases. 

The photo-z bias is non- negligible for most magni- 
tudes and redshifts regardless of the method. With typical 
template-based photo-z methods it is very hard to suppress 
this bias without introducing larger scatter or more outliers. 
The strategy for scientific studies using such photo-z's must 
be to properly calibrate and account for this bias in the 
analysis. We do not correct for the bias at this stage. 

In Fig. [S] we present the photo-z accuracy for the dif- 
ferent SE P templates as det ermined by the photo-z code. 
Similar to lllbert etall (120061 ) we find that the accuracy for 
elliptical galaxies and spiral galaxies is very similar. Only 
actively star-forming galaxies show a degraded photo-z ac- 
curacy. It is obvious that the accuracy for ellipticals/spirals 
suffers considerably once the 4000 A-/Balmer- break starts to 
leave the filter set at z ~ 1.1/1.3. 



probability distribution. Objects with |Az| > 0.15 are re- 
garded as outlierslfl For a given sample we then calculate the 



° This choice is arbitrary and is mainly used for historical reasons 
since many photo-z studies in the past have adopted the same 



definition of an outlier. With a typical photo-2 scatter of ct ~ 0.04 
it ensures that only 4-(t outliers are counted. 
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global 



local 




Figure 3. Photo-z's vs. spcc-z's for the three different ways of homogenising the PSF. Shown are all objects with secure spec-z's (from 
VVDS, DEEP2, and SDSS) of the 20 fields with VVDS or DEEP2 overlap. No magnitude or ODDS cut is applied. 



local;global;none;T0006 



local;global;none;T0006 




Figure 4. Photo-z statistics as a function of magnitude (left) and redshift (right) showing the effects of different ways of homogenising 
the PSF. While none and T0006 correspond to no PSF homogenisation, global corresponds to a constant Gaussian kernel used for the 
convolution of the image in one band, and local corresponds to a non-Gaussian, spatially varying kernel that leads to the same Gaussian 
PSF over the whole field. The top panel shows the photo-z scatter after outliers were rejected, the middle panel shows the outlier rate, 
and the bottom panel shows the bias (outliers included; positive means photo-z's overestimate the spec-z's). Errors are purely Poissonian. 
Note that the errors between magnitude/redshift bins are correlated. 



5.3 Re-calibration of the photometric zeropoints 

It has been suggested in the literature (e.g. ICoe et al1l2006l : 
lllbert et af]|2006l : ICoupon et"al]l2009l ) that a re-calibration 
of the photometric zeropoints of the images with the help 
of spec-z's can lead to an enhanced accuracy of photo-z's. 
The procedure involves running the photo-z code on a sub- 
sample of objects with reliable spec-z's and just fitting for 
the template while fixing the redshift to the spectroscopic 
value. The averaged magnitude differences in a band be- 
tween the best-fitting templates and the observed photom- 
etry can then be applied as corrections to the zeropoint in 
that band. Usually this is done iteratively until convergence 
is reached. 



We performed such a zeropoint re-calibration for the 
none, global, and local methods in the 20 fields with 
VVDS/DEEP2 coverage. This yields 20 zeropoint correc- 
tions for four of the five bandqj and for each method. The 
absolute value of the mean and the standard deviation of 
the 20 zeropoint offsets are shown in Fig. [51 

From the figure it is clear that both the mean and the 
width of the distributions become smaller going from none 
to global to local. In particular, the corrections for the local 

^ The re-calibration procedure is only sensitive to colours. So we 
decide to fix the offset in the i-band - the detection band - to 
Ai = 0. 
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Figure 5. Same as Fig. |4] but showing the accuracy for the different SED templates using the local photometry. 



method mostly vanish, i.e. most local offsets are similar to 
or smaller than the error of a single correction (« 0.02mag). 

The zeropoints used for the catalogue extraction were 
identical in all three methods. Thus, our results for the ze- 
ropoint re-calibrations strongly suggest that mostly PSF- 
effects are corrected by such a procedure^ Furthermore 
it means that proper PSF homogenisation, in combination 
with an accurate absolute photometric calibration, super- 
sedes zer opoint re-cal i bratio n procedures like the ones pre- 
sented in lllbert et all (|2006l ); ICoupon et all (|2009l ). 

This is confirmed by looking at the photo-z accuracy 
before and after re-calibration. In Fig. [7] the photo-z statis- 
tics are shown again as a function of magnitude and redshift, 
for the none method and a re-calibrated none method. The 
local method is also plotted as a benchmark. The improve- 
ment is striking although the accuracy of the 'none-recalib' 
method does not reach the accuracy of the local method 
at the faintest magnitudes. Interestingly the lines for a re- 
calibrated local method are nearly indistinguishable from 
the basic local method. 

We would like to stress that these findings do not tell 
the whole story and the situation might be even worse for 
re-calibrated photometry. Re-calibrations of the photomet- 
ric zeropoints might well fool the user into believing that 
the photo-2 accuracy is better than it is in reality. If most 



Certainly, there are other effects that play a role here. For 
example, absolute photometric calibrations are often done with 
standard stars that were observed in a slightly different photo- 
metric system than the instrumental one. Conversions between 
the standard system and the instrumental system depend on the 
colour term of the object. Thus, a correction that is correct on 
average for stars is not correct on average anymore for galax- 
ies, which have very different spectral energy distributions. Often 
also the filter-curves used for template-based photo-z's have some 
uncertainty, especially in the ultra-violet where the differential 
atmospheric transparency has a large influence on the effective 
throughput. 
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Figure 6. Absolute values of the means (top) and standard de- 
viations (bottom) of the 20 zeropoint offsets for the different PSF 
homogenisation methods and different bands. 



of the corrections are due to PSF-effects as our results sug- 
gest, then these corrections depend on the average angular 
size of the objects. But the average size of the photomet- 
ric galaxy sample used for the science projects often differs 
from the average size of the spectroscopic calibration sample 
which is not only used for the zeropoint re-calibration but 
also for the following assessment of the photo-z accuracy. 
This circular use is dangerous if the systematic effects that 
are corrected for depend on the nature of the objects (e.g. 
their size) and are not identical for all objects (like e.g. real 
photometric zeropoints). Furthermore, if the zeropoint cor- 
rections depend sensitively on seeing it is not advisable to 
apply the correction found on one particular field to another 
field. 

Based on these findings we decide to use the local 
method for all scientific projects in the CFHTLenS. It offers 
the best photo-z accuracy combined with the most stable 
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Figure 7. Same as Fig. [4] but showing the effects of zero point re-calibration on the none method (i.e. without PSF-homogenisation) 
in comparison to the default local method (i.e. local PSF Gaussianisation) . The solid lines correspond to the original methods whereas 
the dotted lines correspond to the re-calibrated methods. It is clearly visible that the re-calibrated none method performs similar to the 
original (i.e. not re-calibrated) local method, suggesting strongly that zeropoint re-calibrations are mostly correcting for PSF-effects and 
not for real biases in the photometric zeropoints. 



photometry and does not require re-calibration of the ze- 
ropoints to achieve this. The photo-z's for this method are 
well understood in the redshift range 0.1 < Zphot < 1.3 for 
i < 24.5 - two magnitudes f ainter than the analysis pre- 
sented in lCoupon et al.l (|2009l ) - with photo-z scatter values 
in the range 0.03 < a < 0.06 and outlier rates smaller than 
10 per cent. 



5.4 Selection of sub-samples with higher photo-« 
accuracy 

The ODDS parameter introduced bv lBenfted (j2000l ) and de- 
scribed in Sect. 15.11 can be used to select sub-samples of 
galaxies with a higher photo-z accuracy, with the trade-off 
of a decreased completeness and an implicit colour selec- 
tion. In Fig. [8] the photo-z statistics for different cuts on the 
ODDS parameter are shown. Also the completeness of the 
sample is reported when such cuts are applied. 

The main effect of a cut on ODDS is that the outlier 
rates at faint magnitudes are reduced. However, by looking 
at the redshift dependence (right panel of Fig. [8]) it becomes 
clear that these problematic objects are mostly assigned very 
high redshifts of Zphot > 1-5 (i.e. they don't appear in the 
right-hand panel). One exception is a feature at Zphot ~ 0.75 
where the outlier rate can be effectively suppressed by cut- 
ting on ODDS. But overall ODDS has a negligible impact in 
the well-understood redshift range of 0.1 < 2:phot < 1-3. For 
most applications it is probably not meaningful to apply a 
global cut on ODDS, most importantly because such a selec- 
tion always entails an implicit colour selection. A redshift- 
dependent cut on ODDS could make sense for applications 
that could tolerate such a selection (e.g. selection of back- 
ground sources for weak gravitational lensing). 



5.5 Star-galaxy separation 

We separate stars from galaxies using a combination of size, 
magnitude and colour information. Although a pure stellar 
sample can be obtained by isolating the stellar branch in the 
size-magnitude plane, the mixing of faint and small galaxies 
with stars considerably complicates the separation of both 
classes of objects in this regime. For science cases requiring 
a pure and complete galaxy sample, having a robust star- 
galaxy estimator becomes a key issue. 

A great advantage of template-fitting methods to esti- 
mate photo-z's is the ability to use different template sets. 
Therefore, in addition to galaxy templates, one is able to 
test for stellar templates as well. We run BPZ again fix- 
ing the redsh ift to z = and using the stellar spectral li- 
brary from Pi ckles! l|l998f ). For each object, BPZ outputs a 
best-fitting estimate from the stellar library with an associ- 
ated Xstar which can be compared to Xgai associated with 
the best-fitting galaxy template and redshift. Then ideally 
a star would satisfy the relation Xstar < Xgai- However, one 
should keep in mind that both estimators were computed 
from independent template libraries and the comparison is 
therefore not straightforward because of different degrees 
of freedom. To overcome this difficulty, the estimator can 
be tested and calibrated on spectroscopic dat a. The method 
was applied previously on the CFHTLS Wide (|Coupon et al.l 
2009) and tested usin g spectroscopic da ta from VVDS F02 
(■Le Fevre et al.ll2005l ) and VVDS F22 iGarilH et al.ll2008l ) 
surveys. The following criteria, combining size, magnitude 
and colour information, have been found to give the best 
compromise between a pure and a complete galaxy sample 
when tested on the VVDS: 

• for i < 21: all objects with < rh,iimit are flagged as 
stars. 
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• in the range 21 < i < 23, objects with < r^jimit and 
xitar < 2 X xlai are flagged stars, 

• and for i > 23 all objects are flagged as galaxies, 

where is the average half light radius in a field com- 
puted in the i-band image and rh,iimit the 3-a upper limit of 
the rh distribution in a single image. The rh^umit values for 
each field are determined by manually inspecting the size- 
magnitude diagrams 

We present the method efficiency as a function of mag- 
nitude and redshift in Fig[9l True galaxies and true stars are 
given by spectroscopic information. The incompleteness is 
defined as the percentage of galaxies lost after selection com- 
pared to the total number of true galaxies, and the contam- 
ination as the number of true stars misidentified as galaxies 
compared to the total number of true galaxies. A robust 
estimator should lead vanishing numbers for both estima- 
tors. In the range 21 < i < 23, where size, magnitude and 
colour information are used, the estimator performed the 
best, keeping the contamination below 5 per cent and the 
incompleteness below 15 per cent. Extending the estimator 
to brighter magnitude in W4, where the star concentration 
is very high, increases the galaxy incompleteness. The star- 
galaxy separation strongly depends on the redshift estimate 
of the objects. As seen in Sec. 15.21 the redshift estimator 
is the most robust in the range 0.1 < z < 1.3, where both 
incompleteness and contamination remain below 5 per cent 
and 10 per cent, respectively. It seems however that a con- 
servative redshift cut z < 1.1 should be observed if a very 
complete galaxy sample is needed in star-crowded fields, or 
alternatively apply a hard size cut of rh > fhjimit so that 
no star will pass. It should be noted that the high-galactic- 
latitude fields of the CFHTLS contain only very few stars 



so that star-galaxy separation is not a major issue for most 
WL science projects planned by the CFHTLenS team. 

5.6 Redshift distributions 

The redshift distributions of all objects classified as galaxies 
are shown in Fig. [TU] for three different magnitude limits. 
We show the distributions of the most probable photo-z's 
as well as the stacked posterior probabilities output by the 
photo-z code. While there is good agreement between the 
two for i < 23, the double-peaked histogram for the faintest 
magnitude cut (i < 24) is not reproduced in the stacked 
posterior probabilities. This redshift-focussing effect occurs 
when the prior dominates the posterior probability distribu- 
tion for wide, flat likelihoods (plateaus) in the low S/N case. 
A large number of objects are then assigned the peak value 
of the prior which leads to artificial peaks in the redshift 
histograms. This is the reason why we recommend to use 
the full probability distributions, after a proper deconvolu- 
tion taking the photometric uncertainties into account was 
performed, instead of the most probable redshifts (i.e. the 
peak of the posterior) in science analyses^ Similar double- 
peaked ph oto- 2: distributions for i < 24 in the CFHTLS can 
be seen in lllbert et al.l (120061 ) and ICoupon et all (120091 ') so 
we suspect the filter set plays a crucial role here. But also the 
prior can lead to multi-peaked distributions. A final answer 
to this question requires a highly complete spec- 2; catalogue 
all the way down to i = 24. 

There are several methods discussed in the literature 
to correct these redshift probability densities (e.g. iNewmaiil 
2008; Bcniamin ct al. 2010) to make them more realistic 
using angular cross-correlation functions between different 



It should be noted that the estimation becomes less and 
less reliable at faint magnitudes. But for i < 23, where we use it, 
it is still largely unbiased. 



It should be noted that the stacked posterior probabilities 
are certainly affected by the ad-hoc modification of the prior de- 
scribed in Sect. 15.11 
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Figure 9. Star-galaxy separation efficiency as a function of i-band magnitude (left) and photo-z (right). Shown are the fraction of 
stars (frac; top), contamination of the galaxy sample (cont.; middle), and the incompleteness of the galaxy sample (inc.; bottom). 
Our estimator is tested and calibrated using the VVDS spectroscopic samples in Wl (solid line) and W4 (dashed line). Incompleteness 
represents the percentage of galaxies lost after selection, and contamination the percentage of stars misidentified as galaxies. 



photo-2 bins or between photo-z and spec-z samples. We 
defer such an analysis carried out with the CFHTLenS red- 
shifts to a forthcoming paper (Benjamin et al. in prep.). 

We intentionally do not present functional fits to 
these distributions. Contemporary WL surveys (like the 
CFHTLS) have reached such a precision that redshift dis- 
tributions taken from external surveys would dominate the 
total error budget on most cosmological WL measurements 
l|van Waerbeke et al.ll200^ . It is one great advantage of the 
CFHTLS over previous surveys to have a photo-« estimate 
for each galaxy used in the WL analysis. 



6 CONCLUSIONS 

The CFHTLS represents the largest and therefore most pow- 
erful WL survey to date. Here we present new, improved 
photometric catalogues and photo-z which will be used for 
the work of the CFHTLenS team. We show that the correc- 
tion for PSF-efTects in data taken in different bands at differ- 
ent times is crucially important. Not only does the accuracy 
of the resulting photo-z's increase significantly if PSF-effects 
are thoroughly corrected for, but also the overall photomet- 
ric homogeneity of a survey can be improved considerably 
by employing such corrections. We show that this can be 
done up to the point, where re-calibrations of the photo- 
metric zeropoints with sub-samples of galaxies with spec-z's 
become largely unnecessary. 

This implies that the re-calibration of photometric ze- 
ropoints described in the literature are mostly corrections 
for PSF-effects. It is hence dangerous to assume that the 
photo-z accuracy measured on the same spec-z sample that 
was also used for the re-calibration will be matched by the 
photometric sample. On the positive side, these findings sug- 
gest that future surveys like KiDS, DES, LSST, or Euclid 



will benefit tremendously from careful PSF homogenisation. 
These projects will not necessarily need a spectroscopic cov- 
erage over the whole area to achieve their absolute photo- 
metric calibration goals, if PSF-effects are corrected for with 
high precision. For example, Euclid will calibrate its redshift 
distributions directly from very complete spectroscopic cat- 
alogues that will not cover the whole area. Relating the re- 
sults from these calibration fields to the rest of the survey 
requires exquisite photometric homogeneity. 

The CFHTLenS team will use these photo-z catalogues 
for a wide variety of WL-related science projects ranging 
from galaxy-galaxy lensing and cluster lensing to cosmic 
shear tomography using the shear as well as the magnifica- 
tion effect of WL. Especially the cosmological measurements 
will benefit from this very homogeneous photo-z catalogue. 
Measuring tiny correlations over large angular distances, as 
it is done in cosmic shear or cosmic magnification studies, 
requires an exquisite level of control of systematic effects. 
The removal of PSF-effects from the photometry and the 
resulting accurate photo-z's represent an important step to 
reach this goal. 

The shear measurement technique of CFHTLenS will be 
presented in another technical paper (Miller et al. in prep.) 
and the resulting shear catalogue will be carefully inspected 
for systematic effects (Heymans et al. in prep.). Together 
with the photo-z's presented in this technical paper this will 
set the basis for CFHTLenS science analyses. 
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and the sohd histograms represent the distributions of the most probable photo-z values. The whole Wide survey is shown in black, Wl 
in red, W2 in green, W3 in blue, and W4 in cyan. 
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APPENDIX A: DETAILS OF THE PSF 
HOMOGENISATION 

A key part of our analysis is the homogenisation of the 
PSF between images taken through different filters, so that 
proper colours, representing the same part of each source, 
can be measured. Our approach is to convolve the images 
with a kernel that renders the PSF close to Gaussian, with 
a width that is set by the worst-seeing image. 

The PSF Gaussianisation is performed by first con- 
structing a suitable, spatially variable, kernel, and then con- 
volving the images with it. Both steps take advantages of 
som e of the ma thematical properties of the shapelet formal- 
ism (|Refregieil l2003). 

Shapelets are two-dimensional Gauss-Hermite func- 
tions, of the form 

Sab{x,y) = NatHa{x/l3)Hbiy/l3)e~'-'^'"'\ (Al) 

Nab is a normalisation constant, and j3 the scale radius, x 
and y are Cartesian sky coordinates with respect to a suit- 
ably chosen centre; r is the corresponding polar coordinate. 
H„ is a Hermite polynomial. 

Any source with intensity I{x, y) can be written as a 
superposition of shapelets: 

I{x, y) = ^ SabSab[x, y) (A2) 

ab 

where the Sab are the amplitudes of the different shapelets. 

Shapelets have the useful property that a convolution of 
any two of them can be written as a new (generally infinite) 
shapelet series: 
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Sab ® Scd — ^ ^ CaceCbdf St 



(A3) 



and expressions for the m atrix elements Cimn are given in 
iRefregier fc Bacor] l|2003l l. (Note that there is no require- 
ment for the scale radii of the shapelets to be the same.) If 
we express the PSF and the kernel as shapelet series, with 
coefficients Pab and kab respectively, then this allows us to 
write the result of the convolution P K as a, new shapelet 
series with coefficients 



<■<:/ 



^ ^ ^ ^ CaceCbdf Pabkcd = ^ ^ Mc, 



(A4) 



with 

Mcd,ef ~ y CaceCbdfPa 



(A5) 



encoding the effect of PSF convolution in shapelet space. 
If the PSF is known, AI can be computed. Our Gaussiani- 
sation technique involves constructing a shapelet kernel by 
inverting the equation IA4I As target PSF on the left-hand 
side we stipulate a Gaussian; all its elements are zero except 
the too component. 

Before calculating the kernel, we model the PSF varia- 
tion across the full CFHTLS image. This is done by identi- 
fying all stars above a certain S/N ratio, choosing a suitable 
scale radius for the PSF, making a flux-normalised shapelet 
expansion for each star, and then separately fitting (with 
a two-dimensional polynomial) the variation of each coeffi- 
cient Sab across the image, with outlier rejection. 

This PSF map is then sampled on a regular grid across 
the image, the convolution kernel is calculated at each of 
those points, and a polynomial model fitted to the variation 
of its coefficients across the image. This kernel is then con- 
volved with the original CFHT image. Again here we can 
use a nice mathematical property of shapelets: each is its 
own Fourier transform, making the convolution efficient. 

In practice we truncate the shapelet series: we only have 
pixellated information on the PSF and so cannot sample it 
arbitrarily finely. For robustness we truncate the kernel at a 
lower order than the PSF, to prevent overfitting. The inver- 
sion of ea. lA4l is therefore recast as a least-squares problem, 
in which we determine the kcd that best approximate the 
target Gaussian PSF tab- 

For the CFHT data we find that the following param- 
eters work well: Shapelet order (maximum a -I- 6 in the ex- 
pansion eq. IA1|) equal to 10 for the PSF, 8 for the kernel; 
polynomial order for fitting spatial variation across the im- 
age 5; shapelet scale radius of the PSF map 1.3 times the 
median Gaussian radius fitted to the stars; and target Gaus- 
sian radius 0.8 times the largest scale radius of the images 
to be compared. 



